Task-based End-to-end Model Learning in Stochastic Optimization

نویسندگان

  • Priya L. Donti
  • J. Zico Kolter
  • Brandon Amos
چکیده

In practice, we use SQP to solve (*), finding z⋆ x; θ via a solution for fast argmin differentiation in QPs [3] and then taking derivatives through the quadratic approximation at this optimum. Technical Challenge: Argmin Differentiation We outperform both traditional model learning and model-free policy optimization in terms of task cost, the objective of actual interest in the closed-loop system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cycle Time Optimization of Processes Using an Entropy-Based Learning for Task Allocation

Cycle time optimization could be one of the great challenges in business process management. Although there is much research on this subject, task similarities have been paid little attention. In this paper, a new approach is proposed to optimize cycle time by minimizing entropy of work lists in resource allocation while keeping workloads balanced. The idea of the entropy of work lists comes fr...

متن کامل

Task-based End-to-end Model Learning

As machine learning techniques have become more ubiquitous, it has become common to see machine learning prediction algorithms operating within some larger process. However, the criteria by which we train machine learning algorithms often differ from the ultimate criteria on which we evaluate them. This paper proposes an end-to-end approach for learning probabilistic machine learning models wit...

متن کامل

An integrated vendor–buyer model with stochastic demand, lot-size dependent lead-time and learning in production

In this article, an imperfect vendor–buyer inventory system with stochastic demand, process quality control and learning in production is investigated. It is assumed that there are learning in production and investment for process quality improvement at the vendor’s end, and lot-size dependent lead-time at the buyer’s end. The lead-time for the first batch and those for the rest of the batches ...

متن کامل

Effects of Task-based Academic Listening on High School EFL Students' Listening Comprehension: Does Experiential Learning Style Matter?

Task-based language teaching (TBLT) has been considered as an effective language teaching methodology. However, its applicability for lower-proficiency learners in EFL contexts has not been adequately justified. Moreover, the possible mediating effect of the experiential learning styles on academic listening TBLT has not been targeted in the literature, a gap that this study attempts to fill. T...

متن کامل

End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning

In this paper, we present a neural network based task-oriented dialogue system that can be optimized end-to-end with deep reinforcement learning (RL). The system is able to track dialogue state, interface with knowledge bases, and incorporate query results into agent’s responses to successfully complete task-oriented dialogues. dialogue policy learning is conducted with a hybrid supervised and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017